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56 265 S4 AND VACCIN? 

9 
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HE ART WATER AND VACCINE 


S2 
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RD SI (unique items) 


S3 


206 


S2 AND PY<2000 


S4 


3 


S3 AND (MAP2 OR MAP(W)2 OR MAJOR (W) ANTIGENIC (W) PROTEIN) 


S5 


115 


(MAP2 OR MAP{W)2 OR MAJOR (W) ANTIGENIC (W) PROTEIN) AND RUMIN- 




ANTIUM 


S6 


42 


RD S5 (unique items) 
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RESULT 5 

COWIMMDOM 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 
ORGANISM 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
PUBMED 
COMMENT 

FEATURES 

source 



CDS 



sig^peptide 



CDS 



sig_peptide 



ORIGIN 



COWIMMDOM 2773 bp DNA linear BCT 06-OCT-1994 

cowdria ruminanti urn (clone pF5.2) immunodominant protein, complete 
cds; ORF, complete cds. 
L07385 

L07385.1 Gl:289922 
immunodominant protein. 

Ehrlichia ruminanti urn (heartwater rickettsia) 
Ehrlichia ruminanti urn 

Bacteria; Proteobacteria; Alphaproteobacteria; Rickettsials; 
Anapl asmataceae ; Eh rl i chi a . 
1 (bases 1 to 2773) 

Mahan,S.M., McGui re,T.C. , Semu,S.M., Bowie, M. v., Jongejan, F. , 
Rurangirwa,F.R. and Barbet,A.F. 

Molecular cloning of a gene encoding the immunogenic 21 kDa protein 
of Cowdria ruminanti urn 

Microbiology 140 (Pt 8), 2135-2142 (1994) 
7921263 

Original source text: Cowdria ruminantium (library: puci3 of Suman 
Mahan) dna. 

Locati on/Qual i f i ers 

1. .2773 

/organi sm=" Ehrl i chi a rumi nanti urn" 
/moT_type="genomic dna" 
/db_xref="taxon:779" 
/tissue_lib="puci3 of Suman Mahan" 
1245. .1874 

/experiment="experimental evidence, no additional details 

recorded" 

/codon_start=l 

/transl_table=ll 

/product="immunodominant protein" 
/protei n_i d=" AAA50280 . 1" 
/db_xref="Gl: 289923" 

/trans! ation="MEHIMKAIKFILNLCLLFAAIFLGYSYITKQGIFQPKLHDSPDV 
NISNKADINTSFSLINQDGITISSKDFLGKHMLVLFGFSSCKTICPMELGLASTILDQ 
LGNESDKLQVVFITIDPTKDTVETLKEFHKNFDSRIQMLTGNIEAINQIVQGYKVYVG 
QPDNDNQINHSGIMYIVDKKGEYLTH FVPDLKSKEPQVDKLLSLIKQYL" 
1245. .1301 
/note="putati ve" 
1943. .2773 
/note="putative" 
/codon_start=l 
/transl_table=ll 
/protei n_i d="AAA50281 . 1" 
/db_xref="Gl: 289924" 

/translation="MKKFLLVLFLLLVMLPKDSNAEHIHWGSSTAFPFIAAIAEEFG 
RFSDYGTPIIESVGSGMGFSMFCQSVENSTPDIAMSSRKIKDAEVELCKSNDVHDIIE 
IIIGYDGIVIANSNNSNKLDFTKKDLFKALSKYATSEEYTHSIPVNDFKYWSEINNRF 
PNIDIEVYGPYKNTGTYNILIEEIMQDSCMNHKNFIEVYPDLKKRQHACSMIRNDGKY 
IEVAANENIIIQKIAKNNAAFGIFSFSFLIQNQDKIHGNKIAGVEPTYETISSGKYIL 
S" 

1943. .2005 
/note="putati ve" 



Query Match 100.0%; score 618; DB 15; 

Best Local Similarity 100.0%; Pred. no. 4e-183; 
Matches 618; Conservative 0; Mismatches 0; 



Length 2773; 
indels 0; Gaps 



0; 



Qy 1 ATGAAGGCTATCAAGTTTATACTTAATCTATGTTTACTATTTGCAGCAA I I I I I I I GGGA 60 

i I 1 i II II Ml II 1 I i II I I II 111 I illl II 111 Ml II i 1 II II li I 1 II II 1 i l I I I 

Db 1257 ATGAAGGCTATCAAGTTTATACTTAATCTATGTTTACTATTTGCAGCAA I I I 1 I I I GGGA 1316 



Qy 



6 1 T ATTCTTACATAACA AAAC A AGGTATATTCCAACC AAAATTACACG ACTCTCCTG ATGTT 

I M I M M MM M I MM M M M MMM MM MMM MM II M MMMM I M 



120 



AUTHORS 

TITLE 

JOURNAL 
PUBMED 
COMMENT 

FEATURES 

source 



CDS 



Bowie, M.v. , Dongejan,F. , 

immunogenic 21 kDa protein 



sig_peptide 



CDS 



sig_peptide 

ORIGIN 



Mahan, S.M., McGui re,T.C. , Semu,S.M 
Rurangi rwa, F. R. and Barbet,A.F. 
Molecular cloning of a gene encoding the 
of Cowdria ruminantium ~ ^ . 

Microbiology 140 (Pt 8), 2135-2142 (1994) 
7921263 

Original source text: Cowdria ruminantium (library: pUCl3 of Suman 
Mahan) DNA. 

Locat i on/Qua! i f i er s 

1. .2773 

/orqanism=" Ehrlichia ruminantium" 
/moT_type="genomic DNA" 
/db_xref="taxon:779" 
/tissue_lib="puci3 of Suman Mahan 1 ' 
1245. .1874 

/experiment="experimental evidence, no additional details 

recorded" 

/codon_start=l 

/transl_table=ll 

/product="immunodominant protein" 
/protei n_i d="AAA50280 . 1" 
/db_xref="Gl: 289923" 

/transl ation="MEHIMKAIKFILNLCLLFAAIFLGYSYITKQGIFQPKLHDSPDV 
NISNKADINTSFSLINQDGITISSKDFLGKHMLVLFGFSSCKTICPMELGLASTILDQ 
LGNESDKLQVVFITIDPTKDTVETLKEFHKNFDSRIQMLTGNIEAINQIVQGYKVYVG 
QPDNDNQINHSGIMYIVDKKGEYLTHFVPDLKSKEPQVDKLLSLIKQYL" 
1245. .1301 
/note="putative" 
1943. .2773 
/note="putative" 
/codon_start=l 
/transl_table=ll 
/protein_id="AAA50281.1" 
/db_xref="Gl: 289924" 

/transl ation="MKKFLLVLFLLLVMLPKDSNAEHIHVVGSSTAFPFIAAIAEEFG 
RFSDYGTPIIESVGSGMGFSMFCQSVENSTPDIAMSSRKIKDAEVELCKSNDVHDIIE 
IIIGYDGIVIANSNNSNKLDFTKKDLFKALSKYATSEEYTHSIPVNDFKYWSEINNRF 
PNIDIEVYGPYKNTGTYNILIEEIMQDSCMNHKNFIEVYPDLKKRQHACSMIRNDGKY 
IEVAANENIIIQKIAKNNAAFGIFSFSFLIQNQDKIHGNKIAGVEPTYETISSGKYIL 
S" 

1943. .2005 
/note="putative" 



Query Match 100.0%; Score 618; DB 15; 

Best Local Similarity 100.0%; Pred. No. 4e-183; 
Matches 618; conservative 0; Mismatches 0; 



Length 2773; 
indels 0; Gaps 



0; 



Qy 1 ATGAAGGCTATCAAGTTTATACTTAATCTATGTTTACTATTTGCAGCAA I I I I I I I GGGA 60 

I I I 1 I 1 I 1 I I I I 1 I I I 1 I I I I I I I I I I I 1 I I 1 I I 1 I I I 1 I 1 I I 1 1 I I I I I I I I I I I I I I 1 
Db 1257 ATG AAG G CTATCAAGTTTATACTTAATCTATGTTTACTATTTG C AGCAA I 1 I I I 1 I GGGA 1316 



QY 
Db 

Qy 

Db 



61 TATTCTTACATAACAAAACAAGGTATATTCCAACCAAAATTACACGACTCTCCTGATGTT 120 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I ! I I I I I I I I I I I I I I I I I I I I I I M I 

1317 TATTCTTACATAACAAAACAAGGTATATTCCAACCAAAATTACACGACTCTCCTGATGTT 1376 



121 



180 



AATATATCGAACAAAGCGGATATAAATACTAGCTTTAGCTTAATTAATCAGGATGGTATT 

I I II 1 1 II 1 1! II II II III Mill III II IIMI1IMI IMIIIIII III MM II 1 1 

1377 AATATATCGAACAAAGCGGATATAAATACTAGCTTTAGCTTAATTAATCAGGATGGTATT 1436 



Qy 

Db 

Qy 

Db 



181 ACGATATCTAGTAAAGACTTCCTTGGAAAACATATGTTAGTCC I I I I I GGGTTTTCTTCT 240 

I I M M I M M I I 1 1 I M 1 1 1 1 Ml 1 1 1 1 1 I I M I i 1 1 M I I M 1 1 1 1 I 1 1 1 M IM I I I 

1437 ACGATATCTAGTAAAGACTTCCTTGGAAAACATATGTTAGTCC I I I I I GGGTTTTCTTCT 1496 
241 TGTAAAACTATTTGCCCCATGGAACTAGGGTTAGCATCCACAATTCTAGATCAACTTGGC 300 

1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

1497 TGTAAAACTATTTGCCCCATGGAACTAGGGTTAGCATCCACAATTCTAGATCAACTTGGC 1556 



Qy 

Db 

Qy 

Db 
Qy 

Db 

Qy 

Db 
Qy 

Db 

Qy 

Db 



301 A ACG AATCTG ACAAGTTACAAGTAGTCTTTATAACTATTGATCCAACAAAAGATACTGT A 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I M I II I I I I I I I I I M I I I I II I 
1557 AACGAATCTGACAAGTTACAAGTAGTCTTTATAACTATTGATCCAACAAAAGATACTGTA 



361 
1617 
421 



GAAACACTAAAAGAGTTTCACAAAAATTTTGACTCACGGATTCAAATGTTAACAGGAAAC 

I I I M I I I I I I M I I I I I I M II II . Mill II II I I I I I M I I I I I I I I I It I I I I M , 

GAAACACTAAAAGAGTTTCACAAAAATTTTGACTCACGGATTCAAATGTTAACAGGAAAC 



360 
1616 
420 
1676 
480 



ATTGAAGCTATTAATCAAATAGTACAAGGGTACAAAGTATATGTAGGTCAGCCAGACAAT 
I I I I I I II I I I M I I I M I I I I I M I I I I I I I | | M I I I I I I I I I I I I I I I I I I I I I I I I 
1677 ATTGAAGCTATTAATCAAATAGTACAAGGGTACAAAGTATATGTAGGTCAGCCAGACAAT 1736 



481 



540 



GATAACCAAATTAACCATTCTGGAATAATGTATATTGTAGACAAGAAAGGAGAATATTTA 
i I I ! I I I I ! I I I I I I II I I I I I I II I I I I I i I I I I I I I I I I I , I I I I I I I I I I I I I I i! : 
1737 GATAACCAAATTAACCATTCTGGAATAATGTATATTGTAGACAAGAAAGGAGAATATTTA 1796 



541 



600 



ACACATTTTGTACCAGATTTAAAGTCAAAAGAGCCTCAAGTGGATAAATTACTTTCTTTA 

I I MM III Mil II Ml II I I III I III I Ml I III III II Ml I I II I MM I II II I 

1797 ACACATTTTGTACCAGATTTAAAGTCAAAAGAGCCTCAAGTGGATAAATTACTTTCTTTA 1856 



601 



618 



ATTAAGCAGTATCTTTAA 
I M I I I I I I M I I I I I I I 
1857 ATTAAGCAGTATCTTTAA 1874 



RESULT 6 
AF117728 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 
ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



AF117728 

Cowdria ruminantium 
gene, complete cds. 
AF117728 

AF117728.1 Gl: 5163240 



CDS 



630 bp 
isolate Palm 



DNA linear BCT 26-JUL-2000 
River major antigenic protein 2 



(heartwater rickettsia) 

Al phaproteobacteri a ; Ri ckettsi al es ; 



ORIGIN 



Ehrlichia ruminantium 
Ehrlichia ruminantium 
Bacteri a ; Proteobacteri a ; 
Anaplasmataceae; Ehrli chia 

1 (bases 1 to 630) 

Bowie, M. v., Reddy,G.R., Semu,S.M., Mahan,S.M. and Barbet,A.F. 
Potential value of major antigenic protein 2 for serological 
diagnosis of heartwater and related ehrli chial infections 
Clin. Diagn. Lab. Immunol. 6 (2), 209-215 (1999) 
10066656 

2 (bases 1 to 630) 

Bowi e , M . v . , Reddy , G . R . , Semu , S . M . , Mahan , s . M . and Barbet , A . F . 
Direct Submission 

Submitted (05- JAN-1999) Pathobiology , University of Florida, 2015 
SW 16th Avenue, Building 1017, Room V2-162, Gainesville, fl 32610, 
USA 

Locati on/Qua! i f i ers 
1. .630 

/organi sm=" Ehrl i chi a rumi nanti urn" 
/mol_type="genomic DNA" 
/i sol ate=" Palm River" 
/db_xref="taxon:779" 
1. .630 

/codon_start=l 
/transl_table=ll 

/product="maior antigenic protein 2" 
/protein_i d= AAD40617 . 1" 
/db_xref="GI: 5163241" 

/trans! ation= u MEHIMKAIKFILNLCLLFAAIFLGYSYITKQGIFQPKLHDSPDV 
NISNKADINTSFSLINQDGITISSKDFLGKHMLVLFGFSSCKTICPMELGLASTILDQ 
LGNEADKLQVVFITIDPTKDTVETLKEFHKNFDSRIQMLTGNIEAINQIVQGYKVYVG 
QPDNDNQINHSGIMYIVDKKGEYLAHFVPDLKSKEPQVDKLLSLIKQYL" 



ORIGIN 



NISNKADINTSFSLINQDGITISSKDFLGKHMLVLFGFSSCKTICPMELGLASTILDQ 
LGNESDKLQVVFITIDPTKDTVETLKEFHKNFDSRIQMLTGNIEAINQIVQGYKVYVG 
QPDNDNQINHSGIMYIVDKKGEYLTHFVPDLKSKEPQVDKLLSLIKQYL" 



Query Match 100.0%; Score 618; DB 15; Length 630; 

Best Local Similarity 100.0%; Pred. No. 5.3e-183; 

Matches 618; conservative 0; Mismatches 0; indels 0; Gaps 

ATGAAGGCTATCAAGTTTATACTTAATCTATGTTTACTATTTGCAGCAA 1 I I I I 1 I GGGA 
I I I I I i 1 I I I I I I I I I I I I I I I I I I II I 1 I 1 1 I I I I I I 1 I I I I I I I I I I I I I I I I I I 11 I 
ATGAAGGCTATCAAGTTTATACTTAATCTATGTTTACTATTTGCAGCAA I I I I I I I GGGA 

TATTCTTACATAACAAAACAAGGTATATTCCAACCAAAATTACACGACTCTCCTGATGTT 

III II I MM I MM III M Mill MM Mill 1 1 III I Mil II II I Mill III II I 

TATTCTTACATAACAAAACAAGGTATATTCCAACCAAAATTACACGACTCTCCTGATGTT 
AATATATCGAACAAAGCGGATATAAATACTAGCTTTAGCTTAATTAATCAGGATGGTATT 

I II III II Ml MM III II IMIIIIIIIIIMI I MM III I II II I II II I IMIII 

AATATATCGAACAAAGCGGATATAAATACTAGCTTTAGCTTAATTAATCAGGATGGTATT 



Qy 


1 

1 


nk 
DD 


Id 


Qy 


bl 


nk 
UD 


73 
/ 5 


Qy 


1^1 


DD 


1 3 3 

155 


Qy 


lol 


DD 


1 Q3 


Qy 


241 


nk 
DD 


O C 3 


Qy 


5U1 


nk 
DD 


old 


Qy 


ibl 


DD 


t> 7 t> 


Qy 


421 


Db 


433 


Qy 


481 


Db 


493 


Qy 


541 


Db 


553 


Qy 


601 


Db 


613 



TTGACTCACGGATTCAAATGTTAACAGGAAAC 

II I III I HIM Mill MM I Mill Mill I 

GAAACACTAAAAGAGTTTCACAAAAATTTTGACTCACGGATTCAAATGTTAACAGGAAAC 



GAAACACTAAAAGAGTTTCACAAAAAT 

II II M M II II M I I M I I II I 



0; 



60 
72 



120 
132. 
180 
192 
240 



ACGATATCTAGTAAAGACTTCCTTGGAAAACATATGTTAGTCC I I I I I GGGTTTTCTTCT 

I II II 1 1 1 1 1 1 1 II I II II I I II 1 1 1 1 II I II 1 1 1 II II 1 1 II 1 1 M I I I II II 1 1 II 1 1 

ACGATATCTAGTAAAGACTTCCTTGGAAAACATATGTTAGTCC I I I I I GGGTTTTCTTCT 252 



TGTAAAACTATTTGCCCCATGGAACTAGGGTTAGCATCCACAATTCTAGATCAACTTGGC 

I 1 1 1 1 1 1 1 I i i i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i I 1 1 1 1 1 I 

TGTAAAACTATTTGCCCCATGGAACTAGGGTTAGCATCCACAATTCTAGATCAACTTGGC 
AACGAATCTGACAAGTTACAAGTAGTCTTTATAACTATTGATCCAACAAAAGATACTGTA 

IIMIIIIIIIIIIIMIIIIIIIIIIIIIIIMIIIMIIIIIIIIIIIIIIIIIIMI 

AACGAATCTGACAAGTTACAAGTAGTCTTTATAACTATTGATCCAACAAAAGATACTGTA 



ATTGAAGCTATTAATCAAATAGTACAAGGGTACAAAGTATATGTAGGTCAGCCAGACAAT 

I II III II HIM II III 1 1 IMIII Ml I III 1 1 MM MM I MM I MM I HIM I 

ATTGAAGCTATTAATCAAATAGTACAAGGGTACAAAGTATATGTAGGTCAGCCAGACAAT 



300 
312 
360 
372 
420 
432 
480 
492 
540 



GATAACCAAATTAACCATTCTGGAATAATGTATATTGTAGACAAGAAAGGAGAATATTTA 

1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 M I i 1 1 1 1 1 M I 

GATAACCAAATTAACCATTCTGGAATAATGTATATTGTAGACAAGAAAGGAGAATATTTA 552 
ACACATTTTGTACCAGATTTAAAGTCAAAAGAGCCTCAAGTGGATAAATTACTTTCTTTA 600 

I MM MM II MM II II I I MM MM MM 1 1 MM I MM I III I III II I MM I 

ACACATTTTGTACCAGATTTAAAGTCAAAAGAGCCTCAAGTGGATAAATTACTTTCTTTA 612 

ATTAAG CAGTATCTTTAA 618 

I I I I I I II I I I II I I II I 
ATTAAG CAGTATCTTTAA 630 



RESULT 5 

COWIMMDOM 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 
ORGANISM 



REFERENCE 



DNA linear BCT 06-OCT-1994 
immunodominant protein, complete 



COWIMMDOM 2773 bp 

Cowdria ruminantium (clone pF5.2) 
cds; ORF, complete cds. 
L07385 

L07385.1 Gl: 289922 
immunodominant protein. 

Ehrlichia ruminantium (heartwater rickettsia) 
Ehrlichia ruminantium 

Bacteria; Proteobacteria; Alphaproteobacteria; Rickettsials; 
Anapl asmataceae ; Eh rl i chi a . 
1 (bases 1 to 2773) 



RESULT 4 
AF117727 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 
ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



CDS 



ORIGIN 



AF117727 630 bp DNA linear BCT 26-JUL-2000 

Cowdria rumi nanti um isolate Highway major antigenic protein 2 gene, 
complete cds. 
AF117727 

AF117727.1 Gl: 5163238 

Ehrlichia rumi nanti um (heartwater rickettsia) 
Ehrlichia rumi nanti um 

Bacteria; Proteobacteria; Alphaproteobacteria; Rickettsials; 
Anapl asmataceae ; Ehrl i chi a . 

1 (bases 1 to 630) 

Bowie, M. v., Reddy,G.R., Semu,S.M., Mahan,S.M. and Barbet,A.F. 
Potential value of major antigenic protein 2 for serological 
diagnosis of heartwater and related ehrl i chi al infections 
Clin. Diagn. Lab. Immunol. 6 (2), 209-215 (1999) 
10066656 

2 (bases 1 to 630) 

Bowie, M. v., Reddy,G.R., Semu,S.M., Mahan,S.M. and Barbet,A.F. 
Direct Submission 

Submitted (05-JAN-1999) Pathobiology, University of Florida, 2015 
SW 16th Avenue, Building 1017, Room V2-162, Gainesville, FL 32610, 

USA 

Locat i on/Qual i f i ers 
1. .630 

/organi sm= M Ehrl i chi a rumi nanti um" 
/mo l_type= M genomic dna" 
/isolate= M Highway" 
/db_xref="taxon:779" 
1. .630 

/codon_start=l 
/transl_table=ll 

/product= n major antigenic protein 2" 
/protei n_i d= r, AAD40616 . 1" 
/db_xref="Gl: 5163239" 

/translation= M MEHIMKAIKFILNLCLLFAAIFLGYSYITKQGIFQPKLHDSPDV 
NISNKADINTSFSLINQDGITISSKDFLGKHMLVLFGFSSCKTICPMELGLASTILDQ 
LGNESDKLQVVFITIDPTKDTVETLKEFHKNFDSRIQMLTGNIEAINQIVQGYKVYVG 
QPDNDNQINHSGIMYIVDKKGEYLTHFVPDLKSKEPQVDKLLSLIKQYL" 



Alignment Scores: 
Pred. No. : 
Score: 

Percent Similarity: 
Best Local Similarity: 
Query Match: 
db: 



US-10-722-077-26 (1-205) x AF117727 (1-630) 



4.19e-124 


Length: 


630 


1054.00 


Matches: 


205 


100.0% 


Conservative: 


0 


100.0% 


Mismatches: 


0 


100.0% 


Indels: 


0 


15 


Gaps: 


0 



Qy 

Db 

Qy 

Db 

Qy 

pb 

Qy 
Db 



1 MetLysAl ail eLysPhell eLeuAsnLeuCysLeuLeuPheAl aAl all ePheLeuGl y 20 
I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I 

13 ATGAAGGCTATCAAGTTTATACTTAATCTATGTTTACTATTTGCAGCAA I I I I I I I GGGA 72 

21 TyrSerTyril eThrLysGl nGl yll ePheGl nProLysLeuHi sAspSerProAspval 40 
I I I I I I I I I I I.I I I II II I I I I I I I I I M I I I I I II I II II I I I I I I I I II I M I I I I I I 

73 TATTCTTACATAACAAAACAAGGTATATTCCAACCAAAATTACACGACTCTCCTGATGTT 132 

41 AsnlleSerAsnLysAlaAsplleAsnThrSerPheSerLeulleAsnGlnAspGlylle 60 
I 1 1 1 1 I 1 1 I I I I I 1 1 1 1 I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 1 1 I I I I I I I I 

133 AATATATCGAACAAAGCGGATATAAATACTAGCTTTAGCTTAATTAATCAGGATGGTATT 192 

61 ThrileSerSerLysAspPheLeuGlyLysHisMetLeuvalLeuPheGlyPheSerSer 80 
111 MINI MINI MM II II i I! Ill IIMIMI I I I I I I MIM M INI I Ml I 

193 ACGATATCTAGTAAAGACTTCCTTGGAAAACATATGTTAGTCCI I I I I GGGTTTTCTTCT 252 



Qy 

Db 

Qy 

ob 

Qy 

Db 
Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 

RESULT 5 

COWIMMDOM 

LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 
ORGANISM 



REFERENCE 
AUTHORS 

TITLE 

JOURNAL 
PUBMED 
COMMENT 

FEATURES 

source 



81 CysLysThnl eCysProMetGl uLeuGl yLeuAl aSerThril eLeuAspGl nLeuGl y 
I I I II I I M I M I I I II I II M I I M I I M I I II M I I I I M I I II II M I M I I I I M 

253 TGTAAAACTATTTGCCCCATGGAACTAGGGTTAGCATCCACAATTCTAGATCAACTTGGC 



100 
312 
120 



101 AsnGl uSerAspLysLeuGl nval val Phell eThrll eAspProThrLysAspThrval 
MM M M Ml MM MMMM IM MM MIM M MMMMMM MM MMIM 

313 AACGAATCTGACAAGTTACAAGTAGTCTTTATAACTATTGATCCAACAAAAGATACTGTA 372 



121 GluThrLeuLysGluPheHisLysAsnPheAspSerArglleGlnMetLeuThrGlyAsn 140 
M M M M M M M M M M M M M M M M M M MIM M I M M M M M M M M 

373 GAAACACTAAAAGAGTTTCACAAAAATTTTGACTCACGGATTCAAATGTTAACAGGAAAC 432 



141 II eGl uAl ail eAsnGl nil eval Gl nGl yTy rLysVal Ty rval Gl yGl nProAspAsn 
MMIMMM M M MMMMMMMMMMM MMMM MM MMMM M I 

433 ATTGAAGCTATTAATCAAATAGTACAAGGGTACAAAGTATATGTAGGTCAGCCAGACAAT 



160 
492 
180 



161 AspAsnGl nil eAsnHi sSerGl yll eMetTyrll eVal AspLysLysGl yGl uTyrLeu 
MINI III II 1 1 II Mill Mill II 1 1 MM 1 1 MM Ml III II 1 1 II III I Mill 

493 GATAACCAAATTAACCATTCTGGAATAATGTATATTGTAGACAAGAAAGGAGAATATTTA 552 



181 ThrHisPhevalProAspLeuLysSerLysGluProGlnvalAspLysLeuLeuSerLeu 200 
I II 1 1 1 1 1 II 1 1 1 1 1 1 1 1 II 1 1 1 1 II M 1 1 1 II I M I I II 1 1 1 1 1 1 1 1 1 II M 1 1 1 1 1 1 1 

553 ACACATTTTGTACCAGATTTAAAGTCAAAAGAGCCTCAAGTGGATAAATTACTTTCTTTA 612 



201 ileLysGlnTyrLeu 205 
MMMMMIMM 

613 ATT AAG CAGT ATCTT 627 



DNA linear BCT 06-OCT-1994 
immunodominant protein, complete 



CDS 



COWIMMDOM 2773 bp 

Cowdria ruminantium (clone pF5.2) 
cds; ORF , complete cds. 
L07385 

L07385.1 Gl: 289922 
immunodominant protein. 

Ehrlichia ruminantium (heartwater rickettsia) 
Ehrlichia ruminantium 

Bacteria; Proteobacteria; Alphaproteobacteria; Rickettsials ; 
Anaplasmataceae; Ehrlichia. 
1 (bases 1 to 2773) 

Mahan,S.M., McGui re ,T. C. , Semu,S.M., Bowie, M. v., Jongejan , F. , 
Rurangirwa,F.R. and Barbet,A.F. 

Molecular cloning of a gene encoding the immunogenic 21 kDa protein 
of Cowdria ruminantium 

Microbiology 140 (Pt 8), 2135-2142 (1994) 
7921263 

Original source text: Cowdria ruminantium (library: pUCl3 of Suman 
Mahan) DNA. 

Locati on/Qual i f i ers 

1. .2773 

/organi sm="Ehrl i chi a rumi nanti urn" 
/mol_type="genomic dna" 
/db_xref="taxon:779" 
/tissue_lib="pUCl3 of Suman Mahan" 
1245, .1874 

/experiment="experimental evidence, no additional details 

recorded" 

/codon_start=l 

/transl_table=ll 

/product="immunodomi nant protein" 
/protei n_i d="AAA50280 . 1" 
/db_xref="Gl: 289923" 

/translation="MEHIMKAIKFILNLCLLFAAIFLGYSYITKQGIFQPKLHDSPDV 
NISNKADINTSFSLINQDGITISSKDFLGKHMLVLFGFSSCKTICPMELGLASTILDQ 
LGNESDKLQVVFITIDPTKDTVETLKEFHKNFDSRIQMLTGNIEAINQIVQGYKVYVG 



sig_peptide 

CDS 



sig_peptide 

ORIGIN 



QPDNDNQINHSGIMYIVDKKGEYLTHFVPDLKSKEPQVDKLLSLIKQYL" 

1245. .1301 

/note="putati ve" 

1943. .2773 

/note="putati ve" 

/codon_start=l 

/transl_table=ll 

/protein_id="AAA50281.1" 

/db_xref="Gl: 289924" 

/trans! ation="MKKFLLVLFLLLVMLPKDSNAEHIHVVGSSTAFPFIAAIAEEFG 
RFSDYGTPIIESVGSGMGFSMFCQSVENSTPDIAMSSRKIKDAEVELCKSNDVHDIIE 
IIIGYDGIVIANSNNSNKLDFTKKDLFKALSKYATSEEYTHSIPVNDFKYWSEINNRF 
PNIDIEVYGPYKNTGTYNILIEEIMQDSCMNHKNFIEVYPDLKKRQHACSMIRNDGKY 
IEVAANENIIIQKIAKNNAAFGIFSFSFLIQNQDKIHGNKIAGVEPTYETISSGKYIL 
S" 

1943. .2005 
/note="putative" 



Alignment Scores: 
Pred. no.: 
Score: 

Percent Similarity: 
Best Local Similarity: 
Query Match: 
DB: 



US-10-722-077-26 (1-205) x COWIMMDOM (1-2773) 



2.31e-123 


Length: 


2773 


1054.00 


Matches: 


205 


100.0% 


Conservative: 


0 


100.0% 


Mismatches: 


0 


100.0% 


indel s : 


0 


15 


Gaps: 


0 



Qy 1 MetLysAlalleLysPhelleLeuAsnLeuCysLeuLeuPheAlaAlallePheLeuGly 20 

I Mill II II I MINI II II II illl MM III I II Mill II I II II MINIMI 1 1 

Db 1257 ATGAAGGCTATCAAGTTTATACTTAATCTATGTTTACTATTTGCAGCAA I I I I I I I GGGA 1316 

Qy 21 TyrSerTyrlleThrLysGlnGlyllePheGlnProLysLeuHisAspSerProAspVal 40 

I II II I II II I II I I II I II I I I II II II II I II I I I II I I I I I I I I I I I I II I I II I I I 

Db 1317 TATTCTTACATAACAAAACAAGGTATATTCCAACCAAAATTACACGACTCTCCTGATGTT 1376 

Qy 41 AsnileSerAsnLysAlaAspileAsnThrSerPheSerLeulleAsnGlnAspGlylle 60 

I II I II II II I 1 1 1 1 II I II I I II I II II I I I II I 1 1 II II II I I 1 1 1 1 1 1 1 1 I I II II I 

Db 1377 AATATATCGAACAAAGCGGATATAAATACTAGCTTTAGCTTAATTAATCAGGATGGTATT 1436 

Qy 61 ThrlleSerSerLysAspPheLeuGlyLysHisMetLeuvalLeuPheGlyPheSerSer 80 

I II I II II I I I I I II I II II I I II I II II I I I II I 1 1 II II I I Mill I 1 1 II I I I II I I 

Db 1437 ACGATATCTAGTAAAGACTTCCTTGGAAAACATATGTTAGTCC I I I I I GGGTTTTCTTCT 1496 



Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 
Db 

Qy 

Db 

Qy 



81 CysLysThril eCysProMetGl uLeuGlyLeuAl aSerThrll eLeuAspGl nLeuGl y 100 
I II I I I II II I I I II II I II I I II I II II I II II I I I II II II I I II II II II I I II II I 

1497 TGTAAAACTATTTGCCCCATGGAACTAGGGTTAGCATCCACAATTCTAGATCAACTTGGC 1556 

101 AsnGl uSerAspLysLeuGl nvalval Phell eThnleAspProThrLysAspThrVal 120 
I II I II II II I I I II III II I I I II II II I II II I I II I II II II II I I I II I II II II I 

1557 AACGAATCTGACAAGTTACAAGTAGTCTTTATAACTATTGATCCAACAAAAGATACTGTA 1616 

121 Gl uThrLeuLysGl uPheHi sLysAsnPheAspSerArgll eGl nMetLeuThrGl yAsn 140 

I II I II II I I I I I II III II I I II I I I II I II II I I I Illl II I I I II I I II I I I I I II I 
1617 GAAACACTAAAAGAGTTTCACAAAAATTTTGACTCACGGATTCAAATGTTAACAGGAAAC 1676 

141 ileGluAlalleAsnGlnlleValGlnGlyTyrLysValTyrvalGlyGlnProAspAsn 160 

MMMMMIMMMIMMMMMIIMMMMIMMIMMMMIMIMM 
1677 ATTGAAGCTATTAATCAAATAGTACAAGGGTACAAAGTATATGTAGGTCAGCCAGACAAT 1736 

161 AspAsnGl nil eAsnHi sSerGl yll eMetTyril eval AspLysLysGlyGl uTyrLeu 180 
I II II II II II I II II I I I I I I I I II II I I I II I I II II II I II I II II II I I I I I II I I 

1737 GATAACCAAATTAACCATTCTGGAATAATGTATATTGTAGACAAGAAAGGAGAATATTTA 1796 

181 ThrHi sPheVal ProAspLeuLysSerLysGl uProGl nval AspLysLeuLeuSerLeu 200 
IMIIMI I! I II I Mill Mill III II II Illl III Mill II III Mill Ml II i 



Db 

Qy 

Db 



1797 ACACATTTTGTACCAGATTTAAAGTCAAAAGAGCCTCAAGTGGATAAATTACTTTCTTTA 1856 

201 IleLysGlnTyrLeu 205 
I II! II II II I I III 

1857 ATTAAGCAGTATCTT 1871 



RESULT 6 
AF117728 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 
ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



CDS 



ORIGIN 



AF117728 630 bp DNA linear BCT 26-JUL-2000 

Cowdria ruminantium isolate Palm River major antigenic protein 2 
gene, complete cds. 
AF117728 

AF117728.1 Gl: 5163240 

Ehrlichia ruminantium (heartwater rickettsia) 
Ehrlichia ruminantium 

Bacteria; Proteobacteria; Alphaproteobacteria; Rickettsiales; 
Anapl asmataceae ; Eh rl i chi a . 

1 (bases 1 to 630) 

Bowie,M.v. f Reddy,G.R., Semu,S.M., Mahan,S.M. and Barbet,A.F. 
Potential value of major antigenic protein 2 for serological 
diagnosis of heartwater and related ehrlichial infections 
Clin. Diagn. Lab. Immunol. 6 (2), 209-215 (1999) 
10066656 

2 (bases 1 to 630) 

Bowie,M.V., Reddy.G.R., Semu,S.M., Mahan f S.M. and Barbet,A.F. 
Direct Submission 

Submitted (05-JAN-1999) Pathobiology, University of Florida, 2015 
sw 16th Avenue, Building 1017, Room V2-162, Gainesville, FL 32610, 
USA 

Locati on/Qual i f i er s 
1. .630 

/organi sm="Ehrl i chi a rumi nanti urn" 
/mol_type="genomic dna" 
/isolate= M Palm River" 
/db_xref="taxon:779" 
1. .630 

/codon_start=l 
/transl_table=ll 

/product="maior antigenic protein 2" 
/protei n_i d= AAD40617 . 1" 
/db_xref="Gl: 5163241" 

/translation="MEHIMKAIKFILNLCLLFAAIFLGYSYITKQGIFQPKLHDSPDV 
NISNKADINTSFSLINQDGITISSKDFLGKHMLVLFGFSSCKTICPMELGLASTILDQ 
LGNEADKLQVVFITIDPTKDTVETLKEFHKNFDSRIQMLTGNIEAINQIVQGYKVYVG 
QPDNDNQINHSGIMYIVDKKGEYLAHFVPDLKSKEPQVDKLLSLIKQYL" 



Alignment Scores: 
Pred. No. : 

Score: 

Percent Similarity: 
Best Local Similarity: 
Query Match: 
db: 



US-10-722-077-26 (1-205) x AF117728 (1-630) 



4.36e-123 


Length: 


630 


1046.00 


Matches: 


203 


99.5% 


Conservative: 


1 


99.0% 


Mismatches : 


1 


99.2% 


indels: 


0 


15 


Gaps: 


0 



Qy 

Db 

Qy 

Db 

Qy 



20 



1 MetLysAlalleLysPhelleLeuAsnLeuCysLeuLeuPheAlaAlailePheLeuGly 
MINIM II I I II I I INI Mill MM I I INI MM I MINI I M ill Ml M ill 

13 ATGAAGGCTATCAAGTTTATACTTAATCTATGTTTACTATTTGCAGCAA I I I I I I I GGGA 72 

21 TyrSerTyrll eThrLysGl nGl yll ePheGl nProLysLeuHi sAspSerProAspval 40 
M M M M M I M M M 1 1 M M M M 1 1 M I II M I M M M 1 1 M M M M M M M I 

73 TATTCTTACATAACAAAACAAGGTATATTCCAACCAAAGTTACACGACTCTCCTGATGTT 132 

41 AsnileserAsnLysAlaAsplleAsnThrserPheSerLeuileAsnGlnAspGlyile 60 



Db 

Qy 

Db 

Qy 

Db 

Qy 
Db 

Qy 
Db 

Qy 
Db 

Qy 

Db 

Qy 

Db 

Qy 

Db 

RESULT 7 
AF117729 
LOCUS 

DEFINITION 

ACCESSION 
VERSION 
KEYWORDS 
SOURCE 
ORGANISM 



REFERENCE 
AUTHORS 
TITLE 

JOURNAL 
PUBMED 
REFERENCE 
AUTHORS 
TITLE 
JOURNAL 



FEATURES 

source 



I I 1 1 I! 1 1 I! I 1 1 I I I I I 1 1 1 I 1 1 1 I 1 1 1 I Ml I 1 1 I I I 1 1 I I 1 1 I I 1 1 I I I I! I 1 1 1 I I 

133 AATATATCGAACAAAGCGGATATAAATACTAGCTTTAGCTTAATTAATCAGGATGGTATC 



192 



61 ThrlleSerSerLysAspPheLeuGlyLysHisMetLeuValLeuPheGlyPheSerSer 80 

i i 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 

193 ACGATATCTAGTAAAGACTTCCTTGGAAAACATATGTTAGTCC I I I I 1 GGGTTTTCTTCT 252 



81 



100 



CysLysTh nl eCysProMetGl uLeuGl yLeuAl ase rTh nl eLeuAspGl nLeuGl y 
I 1 11 M I M 1 1 1 1 ! 1 1 1 1 1 1 1 1 1 1 1 M II I Ml I II 1 1 1 1 1 1 1 Ml INI 1 1 Ml III 1 1 

253 TGTAAAACTATTTGCCCCATGGAACTAGGGTTAGCATCCACAATTCTAGATCAACTTGGC 312 



101 AsnGl uSerAspLysLeuGl nval val Pheil eThrll eAspProThrLysAspThrVal 120 
I 1 1 I I I ::: I I I I I I 1 1 I I M 1 1 1 1 I 1 1 1 I I II II 1 1 I 1 1 1 I i M I 1 1 1 I I I 1 1 I 1 1 1 I I 

313 AACGAAGCTGACAAGTTACAAGTAGTCTTTATAACTATTGATCCAACAAAAGATACTGTA 372 



121 GluThrLeuLysGluPheHisLysAsnPheAspSerArglleGlnMetLeuThrGlyAsn 140 
1 1 1 1 1 1 1 1 1 1 1 II 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 [ IT I i 1 1 M 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

373 GAAACACTAAAAGAGTTTCACAAAAATTTTGACTCACGGATTCAAATGTTAACAGGAAAC 432 



141 il eGl uAl ail eAsnGl nil eval Gl nGl yTyrLysvalTy rval Gl yGl nProAspAsn 160 
III III Mill I II M MMMMMMMIMM IMIIMI I MM I Ml II llllll 

433 ATTGAAGCTATTAATCAAATAGTACAAGGGTATAAAGTATATGTAGGTCAACCAGACAAT 492 



161 AspAsnGl nil eAsnHi sSerGl yll eMetTyril eval AspLysLysGl yGl uTyrLeu 
1 1 1 II 1 1 II II 1 1 1 1 MM II MM 1 1 II 1 1 M 1 1 II 1 1 1 1 1 1 II 1 1 II M M I M II 1 1 

493 GATAACCAAATTAACCATTCTGGAATAATGTATATTGTAGATAAGAAAGGAGAATATTTA 



180 
552 
200 



181 ThrHisPheval ProAspLeuLysSerLysGluProGlnValAspLysLeuLeuSerLeu 
1 1 M I II I I M 1 1 1 1 M II II I 1 1 1 II I II II 1 1 1 1 Ml I I 1 1 II I I M M I II 1 1 I 

553 GCACATTTTGTACCAGATTTAAAGTCAAAAGAGCCTCAAGTAGATAAATTAC7TTCTTTA 612 



201 ileLysGlnTyrLeu 205 
I ! 1 1 1 1 1 1 1 1 1 1 1 1 1 

613 ATTAAGCAGTATCTT 627 



CDS 



AF117729 630 bp DNA linear BCT 26-JUL-2000 

Cowdria ruminantium isolate Urn Banein major antigenic protein 2 
gene, complete cds. 
AF117729 

AF117729.1 Gl: 5163242 

Ehrlichia ruminantium (heartwater rickettsia) 
Ehrlichia ruminantium 

Bacteria; Proteobacteria; Alphaproteobacteria; Rickettsiales ; 
Anapl asmataceae ; Eh rl i chi a . 

1 (bases 1 to 630) 

Bowie.M.v., Reddy f G.R., Semu.s.M., Mahan,S.M. and Barbet,A.F. 
Potential value of major antigenic protein 2 for serological 
diagnosis of heartwater and related ehrlichial infections 
Clin. Diagn. Lab. Immunol. 6 (2), 209-215 (1999) 
10066656 

2 (bases 1 to 630) 

Bowie,M.v. f Reddy,G.R., Semu,S.M., Mahan,S.M. and Barbet,A.F. 
Direct Submission 

submitted (05-JAN-1999) Pathobiology , University of Florida, 2015 
sw 16th Avenue, Building 1017, Room V2-162, Gainesville, FL 32610, 
USA 

Locati on/Qual i f i ers 
1. .630 

/organi sm= M Ehrl i chi a rumi nanti urn" 
/mol_type="genomic DNA" 
/isolate="um Banein" 
/db_xref="taxon:779" 
1. .630 

/codon_start=l 



/transl_table=ll 

/product= ,, major antigenic protein 2" 
/protei n_i d=* AAD40618 . 1" 
/db_xref="GI: 5163243" 

/trans! ation="MEHIMKAIKFILNLCLLFAAIFLGYSYITKQGIFQPKLHDSPDV 
NISNKADINTSFSLINQDGITISSKDFLGKHMLVLFGFSSCKTICPMELGLASTILDQ 
LGNEADKLQWFITIDPTKDTVETLKEFHKNFDSRIQMLTGNIEAINQIVQGYKVYVG 
QPDNDNQINHSGIMYIVDKKGEYLAHFVPDLKSKEPQVDKLLSLIKQYL" 

ORIGIN 

Alignment Scores: 

Pred. No.: 4.36e-123 Length: 630 

Score: 1046.00 Matches: 203 

Percent Similarity: 99.5% Conservative: 1 

Best Local Similarity: 99.0% Mismatches: 1 

Query Match: 99.2% Indels: 0 

DB: 15 Gaps: 0 



us-io- 


722-077- 


-26 (1-205) x AF117729 (1-630) 




Qy 


1 


MetLysAl ail eLysPheil eLeuAsnLeuCysLeuLeuPheAl aAl all ePheLeuGl y 
IIIIIIIMIIIIIIIIIIIIIIIillllllllllllllllllMIIIIIIIIMIIIII 

ATGAAGGCTATCAAGTTTATACTTAATCTATGTTTACTATTTGCAGCAA 1 1 1 1 1 1 1 GGGA 


20 


Db 


13 


72 


Qy 
Db 


21 
73 


TyrSerTyrileThrLysGlnGlyilePheGlnProLysLeuHi sAspSerProAspVal 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
TATTCTTACATAACAAAACAAGGTATATTCCAACCAAAGTTACACGACTCTCCTGATGTT 


40 
132 


Qy 

Db 


41 
133 


Asnll eSerAsnLysAl aAspll eAsnThrSerPheSerLeull eAsnGl nAspGl yll e 
1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 
AATATATCGAACAAAGCTGATATAAATACTAGCTTTAGCTTAATTAATCAGGATGGTATC 


60 
192 


Qy 
Db 


61 
193 


ThrileSerSerLysAspPheLeuGlyLysHisMetLeuValLeuPheGlyPheSerSer 

llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 
ACGATATCTAGTAAAGACTTCCTTGGAAAACATATGTTAGTCCI 1 1 1 1 GGG 1 1 1 1 C 1 1 C 1 


80 
252 


Qy 
Db 


81 
253 


CysLysThril eCysProMetGl uLeuGl yLeuAl aSerThril eLeuAspGl nLeuGl y 
llllllllllllllllllllllllllllllllllllllllllllllllllllllllllll 

TGTAAAACTATTTGCCCCATGGAACTAGGGTTAGCATCCACAATTCTAGATCAACTTGGC 


100 
312 


Qy 

Db 


101 
313 


AsnGl uSerAspLysLeuGl nval Val Phell eThrll eAspProThrLysAspThrVal 

1 1 1 1 1 1 ::: 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 
AACGAAGCTGACAAGTTACAAGTAGTCTTTATAACTATTGATCCAACAAAAGATACTGTA 


120 
372 


Qy 


121 


GluThrLeuLysGluPheHisLysAsnPheAspSerArgileGl nMetLeuThrGlyAsn 
1 Ml IMIMI MM MMMMMMMMMIMlIMM Mill Mill Mill 

GAAACACTAAAAGAGTTTCACAAAAA 1 1 1 1 GACTCACGGATTCAAATGTTAACAGGAAAC 


140 


Db 


373 


432 


Qy 

Db 


141 
433 


II eGl uAl all eAsnGl nil eval Gl nGl yTyrLysval Ty rval Gl yGl nProAspAsn 
1 1 M 1 1 M 1 M 1 M M 1 M M 1 M 1 M M 1 M M 1 1 1 M M 1 M M M 1 1 M M 1 M M 1 

ATTGAAGCTATTAATCAAATAGTACAAGGGTATAAAGTATATGTAGGTCAACCAGACAAT 


160 
492 


Qy 


161 


AspAsnGl nil eAsnHi sSerGl yll eMetTyril eval AspLysLysGl yGl uTy rLeu 
1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 i 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 11 1 1 1 1 M 

GATAACCAAATTAACCATTCTGGAATAATGTATATTGTAGATAAGAAAGGAGAATATTTA 


180 


Db 


493 


552 


Qy 


181 


ThrHi sPheValProAspLeuLysSerLysGluProGlnvalAspLysLeuLeuSerLeu 
M 1 1 1 1 M 1 M M 1 M 1 M M 1 1 1 M 1 M M 1 M M 1 M M 1 M M 1 M M 1 M M 1 

GCACATTTTGTACCAGATTTAAAGTCAAAAGAGCCTCAAGTAGATAAATTACTTTCTTTA 


200 


Db 


553 


612 


Qy 


201 


ileLysGlnTyrLeu 205 
1 1 M M 1 1 1 M 1 M 1 

ATTAAGCAGTATCTT 627 




Db 


613 





